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The Status of the Determination of ct(m z ) and ct s (m z ). 

Tatsu Takeuchi*) 

Fermi National Accelerator Laboratory 
P. 0. Box 500, Batavia, IL 60510, U.S.A. 

I will discuss the current status of the determination of a(mz) and a,(mz), emphasizing 
the pitfalls that one must avoid in performing statistical analyses. 

§1. Introduction 

Precise determinations of the values of a(mz) and a s (mz) are important in the 
context of both the Standard Model (SM) and Grand Unified Theories (GUT's) which 
are the two major themes of this conference. 

In the SM, the values of a(mz) and a s (mz) are necessary as inputs in calculat- 
ing the predictions for observables at the Z-mass scale. The accuracy in which they 
are known is reflected directly onto the accuracy of the SM predictions that must be 
compared with the precision electroweak measurements at LEP/SLC. Therefore, if one 
wishes to test the SM at the level of radiative corrections and/or detect virtual effects 
of new physics in the LEP/SLC measurements, one needs to know a(mz) and a s (mz) 
to very high accuracy. This point is well summarized in the talk by Hagiwara in this 
proceedings ^ . 

In the case of GUT's, the three gauge couplings of SUc(3) x SUl(2) x £/y(l) must 
coincide when evolved up to the GUT scale using renormalization group equations 
(RGE's). Whether they actually coincide or not depends on their initial values at 
s = m 2 z , and the particle spectrum contributing to the RGE's. Therefore, precise 
values of a(m z ) and a s (mz) will provide important constraints on the particle content 
and their masses for any given GUT 2 ) . 

In this talk, I will discuss the current status of the determination of a{m z ) and 
a s (mz). To avoid this talk from being a simple update of Ref. 3), I have shifted my 
emphasis somewhat to the problems in statistical analyses that one encounters when 
trying to extract these numbers from experimental data. 

For a(mz), I will give a comprehensive review of all the recent attempts to deter- 
mine its value from the a(e + e~ — > hadrons) data 4 -*" 10 - ) and clarify where these analyses 
differ from each other. In particular, I will emphasize the problems that must be ad- 
dressed in the treatment of the experimental data, and whether any of these analyses 
have succeeded in doing so or not. My conclusion here will be somewhat different from 
the one I gave at the conference, partly due to the fact that the details of the Swartz 
1995 analysis has been made available 10 ) and it can now be given a proper evaluation, 
and partly due to my better understanding of the problem. 

For a s (mz), I will not be reviewing all the different methods of determining a s (mz) 
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Fig. 1. Evaluations of a(mz) by different authors. All values have been rescaled to mz = 91.1887 
GeV and the top quark contribution has been removed. 

and how reliable they are, since thorough and excellent reviews on the subject by experts 
in the field already exist in the literature 11 )' 12 \ I will instead focus on the problem of 
determining a s (mz) from the Z-lineshape parameter Rg = r(Z — > hadrons) / T{Z — > 
£ + £~) at LEP and discuss an interesting (but erroneous) speculation of Ref. 13) which 
claims that the error on Rg, and thus a s (mz) determined from it, may be grossly 
underestimated based on an exotic (to particle physicists) statistical analysis. 

§2. The value of a(mz) 

The best determination of a(mz) until late 1994 had been that of Jegerlehner 5 ) 
from 1991 which was 

oT x {mz) = 128.87 ± 0.12. (2-1) 

As an example of how this uncertainly in a(mz) affects SM predictions, I quote the 
calculation of sin 2 f9* c ff pt based on this value using the FORTRAN program ZFITTER 4.9 
15 ) which gives 

sin 2 ^ e ff pt = 0.2319 ± 0.0003 (2-2) 

for the SM with m t = 180GeV and m H = 300GeV. The 0.1% error on a(m z ) has 
propagated directly into a similar error on sin 2 ^^*. On the other hand, the experi- 
mental error on sin 2 (9^* has been decreasing steadily. Currently, the averaged value 
of sin 2 from all asymmetry measurements at LEP and SLC is at 14 ) 

sin 2 #' c ff pt = 0.23143 ± 0.00028. (2-3) 

Clearly, an improvement on the knowledge of a(mz) is called for given that the experi- 
mental error is now smaller than the theoretical error, and can be expected to decrease 
even further. 

During the past year or so, several authors have independently made attempts 
to reevaluate the value of a(mz) through careful reanalyses of existing a(e + e~ — > 
hadrons) data 6 - ) ~ 10 ) . The hope was that the uncertainty in a(mz) could be reduced 
since some new data on cr(e + e~ — > hadrons) had been released 16 ^ " 18 \ and the 0(a^) 
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QCD correction to the cross section 19 ) together with a better determination of a s (mz) 
n )' 12 ) was now available. The resulting new values of a(mz) are shown in Fig. 1 
together with a couple of older values. 

Comparing the five new evaluations, we see that the four most recent ones are all 
more or less consistent with each other and the Jegerlehner '91 value with no significant 
decrease in the error. The only number which is slightly off is the Swartz '94 value 
from Ref. 6) which was updated by Swartz himself in Ref. 10). 

Due to this agreement between different evaluations, it does not really matter which 
of these four numbers is used as the standard value of a(mz) from a practical point of 
view. However, since we must decide on one number anyway, we may as well choose 
the one which can be considered to give the best estimate of the true value of a{mz)- 

In the following, I will review how the analyses shown in Fig. 1 differ from each 
other so that we may make an informed decision. But before I begin, I will first review 
the definition of a{mz) and how it can be determined from the a(e + e~ — ► hadrons) 
data so that we can have a clear understanding of the issues that must be addressed. 

2.1. The Definition of a(mz) 

The quantity whose value is usually quoted as that of the "effective QED coupling 
constant at the Z mass scale" is defined as 

a' x (mz) = cT x \Y- Aa(m z )\ (2-4) 

with 

Aa(s) = AnaRe [n' QQ (s) - n' QQ (0)] . (2-5) 
Here, TIqq(s) is the photon vacuum polarization function defined as 

'd 4 xe^(J»(x)r Q (0)) = i(q 2 g^ - q^)n' QQ (s) (2-6) 



where Jq(x) is the electromagnetic current modulo the coupling constant e. 

It has been customary to include only the light fermion contributions to Aa(mz)- 
The top contribution had been excluded from Aa(mz) because the top mass was un- 
known until quite recently 20 ** (though care is needed when comparing results since 
recent authors include it) and the W contribution was also excluded to keep the defi- 
nition of a(mz) gauge independent. They are also numerically small compared to the 
light fermion contribution since the W + W~ and ti thresholds are above the Z mass 
so that they do not contribute logarithms to the running of a(s) between s = and 
s = vn\. I will adhere to this convention in this talk and only discuss the contribution 
of light fermions. Readers who are interested in how the value of a(mz) will change 
when the ti and the W + W~ pinch contributions are included are referred to Table I of 
the talk by Hagiwara ^ . 

2.2. Contribution of the light fermions 

The contribution of the leptons to Aa can be calculated accurately in perturbation 
theory and one finds 

Aa leptons (m 2 z ) = Yl ^ -| + ^!-^(3-/9|)lnQ 



£=e,^,T 



1 + 
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Ea 

t=e,n,T 

= 0.03142, (2-7) 

where = \Jl — Am 2 /m 2 z . 

On the other hand, the contribution of the five light quarks (u, d, s, c, b) to Aa 
cannot be calculated perturbatively. Instead, unitarity and the analyticity of IIqq(s) 
is used to write 

.,(5) _ «*„ r 



as Z" 00 , . i? Is') , N 

(s = — P / <£,'— -L_> (2-8) 
V 7 3vr i 4m 2 s'(s - a') V J 

where 

3s 

i?(s) = 47rQ ,2( g ) °"( e+e " ^ 7* ^ hadrons) = -\2irlmn' QQ (s) (2-9) 

and the functional form of -R(s) is extracted from experiment. It is this reliance on the 
experimental values of R(s), which are currently accompanied by large experimental 
errors, that we end up with a relatively large error on a(mz) even though the fine 
structure constant a is known to extreme accuracy. 

2.3. Problems with using R(s) 

There are a couple of problems that must be addressed when using the experimental 
values of R(s) to calculate a(mz)- 

First, the data for R(s) are only available for discrete, scattered values of s. One 
must therefore interpolate between, and extrapolate beyond the available data points 
to make use of Eq. 2-8. 

Two methods have been used in the literature to deal with this problem. The 
first is to connect the data points directly with straight lines and perform trapezoidal 
integration 4 )' 5 ^' 8 ), and the second is to guess the functional form of R(s) and fit it to 
the data 4 )' 6 )' 7 )' 10 ). 

Both methods have their pros and cons. Trapezoidal integration is free from human 
prejudice about the functional form of R(s) but it is difficult to take into account the 
experimental errors properly: sparsely distributed precise data points may not get the 
appropriate weight relative to the densely spaced data points with larger errors, and 
possible correlations between errors are not accounted for. Connecting two data points 
that are far apart with a straight line will also introduce errors in regions where R(s) 
is changing rapidly. (A kind of 'human prejudice' in a sense.) 

On the other hand, fitting a guessed functional form to the data has the advantage 
that experimental errors are easier to take into account (though care is needed in 
treating normalization errors 21 ) as will be discussed later). However, the result will 
depend on the choice of the fit function and its parameterization, and systematic errors 
and biases will be introduced that are difficult to estimate. 

The two methods are often combined (e.g. fitting the Gounaris-Sakurai 22 ) form to 
the p 23 \ Breit-Wi gner forms to the narrow resonances, and using trapezoidal integra- 
tion for the continuum) and are supplemented by the use of perturbative QCD for the 
high energy tail of R(s). 

Secondly, the experimental values of R(s) are always accompanied by experimental 
errors, both statistical and systematic, and often rather large. 
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The systematic error consists of uncertainties in the measurement/monitoring of 
collider luminosity C, Monte Carlo calculation of the detection efficiency e, and radiative 
corrections 5 that are introduced when converting the measured event rate N into the 
cross section a through the relation 



It is therefore a normalization uncertainty. 

Obviously, such normalization errors on data points belonging to the same experi- 
ment are 100% correlated. This type of correlation of systematic errors within a single 
experiment is called a Type I correlation in Ref. 10). 

However, there can also be strong correlations between the normalization errors 
on data points belonging to two different experiments if they use the same collider, 
same knowledge of radiative corrections, event generators based on similar models, 
same background estimation methods, same detector response models, etc. This type 
of correlation of systematic errors between different experiments is called a Type II 
correlation in Ref. 10). Their exact size is difficult to estimate, but they nevertheless 
exist. 

Taking such correlations into account when extracting R(s) from data has impor- 
tant consequences as we will see later. 

2.4. Recent estimates of a(mz) 

Let us now start our tour of all the different analyses shown in Fig. 1 and see how 
they address the problems discussed above. The order in which they will be presented is 
not necessarily chronological, but what I deem appropriate to contrast the differences. 

2.4.1. The analysis of Burkhardt et al. 

At the time when LEP started running in 1989, the most accurate determination 
of Aa^ drons {m 2 z ) was that given by Burkhardt et al. in Ref. 4): 



the following three methods of interpolation 

1. Trapezoidal integration for the continuum and the p. 
Breit-Wigner forms for the narrow resonances. 

2. Trapezoidal integration for the continuum after a partial smoothing out of the 
R(s) data. (Ref. 4) does not give any detail on how the smoothing was done.) 
Breit-Wigner forms for the narrow resonances. 

3. Breit-Wigner forms for the p and narrow resonances. 

Linear interpolation in W = y/s for every few points in the continuum. 
In all three cases, perturbative QCD (at 0{a 2 s )) was used for R(s) above 40 GeV. The 
difference in Aa^ drons {m 2 z ) due to the difference in the method of interpolation was 
found to be negligible compared to the statistical error. 
Eq. 2-11 corresponds to 



N = aCe(l + $). 



(2-10) 




a {mz) = ol 



-i 



1 - Aa leptons {m 2 z ) - Aafl dTons (m 2 z ) 
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Fig. 2. Comparison of MARK I (black circles) and Crystal Ball (while diamonds) data. Only statis- 
tical errors are shown. The solid line shows the perturbative QCD result. 

= 128.81 ±0.12 (2-12) 

and results in an error of about 0.1% in a(mz)- The region below the bottom threshold 
contributed about 1/3 of Aa h( ^ drons (s), and 80% of the error. 

2.4.2. The analysis of Jegerlehner 

The result of Ref. 4) was subsequently updated in 1991 by Jegerlehner 5 ), who was 
one of the original authors, in which the data from the MARK I collaboration 27 ) in 
the energy region 5-7.4 GeV were replaced by the more accurate data from the Crystal 
Ball collaboration 28 ^. Fig. 2 shows the data of both collaborations in the energy range 
in question, together with the perturbative QCD result. The method of integration 
was the same as method No. 1 of Ref. 4) and the result was 

^Mdrons(m 2 z ) = 0.0282 ± 0.0009, (2-13) 
with no change in the size of the error. This corresponds to 

or x {m z ) = 128.87 ± 0.12 (2-14) 

which had been the standard value until late 1994. 

2.4.3. The analysis of Eidelman and Jegerlehner 

The 1991 Jegerlehner value was again updated by Eidelman and Jegerlehner in 
Ref. 8) to include new data 16 )" 18 ) in the analysis. Again trapezoidal integration was 
used. 

This reference goes into some detail on how experimental errors were taken into 
account. In order to prevent sparsely distributed precise data points from getting less 
weight than densely spaced data points with large errors, which is one of the problems 
inherent in trapezoidal integration, Eidelman and Jegerlehner used the following two 
methods: 

• When more than one experiment gave results in the same energy region, the contri- 
bution to Aa^ drons (m 2 z ) from that energy region was calculated using trapezoidal 
integration for each experiment separately, and then the resulting integrals were 
combined by taking a error weighted average. 
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• Data from different experiments in the same energy region were combined by 
taking local error weighted averages, and trapezoidal integration was performed 
on the combined data. 
Both methods were found to give similar results which is not too surprising since the 
local average for a point s was defined as the average of the R(s) values obtained by 
linear interpolation between the nearest data points. 

After subtracting out the top quark contribution from the value reported in Ref. 8), 
we obtain 

A 4Lons( m2 z) = 0-02804 ± 0.00065, (2-15) 

and 

a -1 (m z ) = 128.89 ±0.09, (2-16) 

which agrees very well with the previous estimate 5 ) . 

2.4.4. The analysis of Burkhardt and Pietrzyk 

The recent analysis of Burkhardt and Pietrzyk in Ref. 9) is another update of 
Ref. 4), but this time using method No. 3, i.e. linear interpolation in W = y/s for every 
few points in the continuum. 

In the 5 — 7.4 GeV energy range, an error weighted average of the MARK I 27 ) and 
Crystal Ball 28 ) data was used instead of discarding the MARK I data altogether as 
in Refs. 5), 6), 8), 10). New data from Refs. 16) -18) were also included. The resulting 
value of Aa n % rons (m 2 z ) was 

^hldronsiO = 0- 0280 ± O- 0007 ' ( 2 - 17 ) 

which corresponds to 

a^imz) = 128.89 ± 0.09. (2-18) 

2.4.5. The analysis of Martin and Zeppenfeld 

The analysis of Martin and Zeppenfeld in Ref. 7) distinguishes itself from all the 
other analyses in its extensive use of perturbative QCD. In table I, I list the energy 
ranges in which different authors have applied perturbative QCD. (Though I also list 
the values of a s (mz) that were used, at the current level of accuracy, the difference is 
insignificant. Changing the value of a s (mz) has little effect on the resulting value of 
a(m z ).) 

In the two energy regions 3-3.9 GeV and 6.5-oo GeV, Martin and Zeppenfeld 
express R(s) as the perturbative QCD value plus the J/&, \P, and T resonances. The 
DASP 25 ), PLUTO 26 ), MARK I 27 ), and Crystal Ball 28 ) data are all rescaled to fit the 
perturbative QCD result in these energy ranges, and the rescaled data is used to resolve 
a couple of resonances in the energy region between 3.9 and 6.5 GeV. See Fig. 3. 

This kind of collective rescaling of data is not as arbitrary as it may seem. The 
normalization errors reported by the experiments shown in Fig. 3 are 15% (DASP), 12% 
(PLUTO), 10~20% (MARK I), and 5.2% (Crystal Ball), respectively, so the rescaled 
data are well within 1 ~ 2a of their original values. If one believes that perturbative 
QCD is valid in this energy range, then it is quite natural to use it to fix the data to 
the 'correct' normalization. 
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Table I. Comparison of the reliance on perturbative QCD in the evaluation of a(mz) between different 
authors. 



Author 


Energy Range (GeV) 


Order in a s 


a s (m z ) 


Burkhardt et al. 4 )' 24 ) (1989) 


40-oo 


2 


0.12 ±0.02 


Jegerlehner 5) (1991) 


40-oo 


2 


0.117 ±0.010 


Swartz 6) (1994) 


15 — 00 


3 


0.125 ±0.005 


Martin & Zeppenfeld 7) (1994) 


3 - 3.9, 6.5 - oo 


3 


0.118 ±0.007 


Eidelman & Jegerlehner 8 - ) (1995) 


40-oo 


3 


0.126 ±0.005 


Burkhardt and Pietrzyk 9) (1995) 


12 - oo 


3 


0.124 ±0.020 


Swartz 10) (1995) 


15 - oo 


Q 
O 


0.116 ±0.005 



Due to the heavy reliance on perturbative QCD in this analysis, and consequently 
the relatively light reliance on experimental data, the uncertainty in Aa^ drons (m 2 z ) is 
reduced. The value quoted in Ref. 7) is 

A 4!drons( m2 z) = °- 02739 ± °- 00042 > ( 2 - 19 ) 

which corresponds to 

a^imz) = 128.98 ± 0.06. (2-20) 

2.4.6. The analyses of Swartz 

The analyses by Swartz in Refs. 6) and 10) fit a smooth function described by 
polynomials in W = y/s to the continuum part of R(s). Resonances were described 
by Breit-Wigner forms as usual, and perturbative QCD (at 0(a^)) was used above 15 
GeV. 

The Swartz analyses distinguish themselves as the first attempts to treat Type I 
and Type II correlations on a firm statistical basis. 

In his first analysis 6 \ Swartz used a standard technique in fitting his function 
Rfit(s;ak) to the R(s) data. Namely, the parameters a& which parameterized his 
function were chosen to minimize the x 2 defined as 

X 2 = ]T [Ri - R fit (si; a k )) Vr 1 [R 3 - R fit (s j; a k )} , (2-21) 

where Ri is the experimental value of the i-th measurement at Sj, and V^ 1 = (VjJ 1 ) is 
the inverse of the variance-covariance matrix V = (Vij): 

Vij = {ARi ARj). (2-22) 

ARi = R(si) — Ri is the difference between the true value of R(si) and the measured 
value Ri, and (O) denotes the expectation value of O. 

Following Swartz, I denote the uncorrelated point to point error on Ri as Gi(ptp) 
and the correlated normalization error as ai(norm): 

R(si) = Ri ± <Ji(ptp) ± Oiinorm). (2-23) 

Assuming that both Type I and Type II correlations are 100% when they are present, 
Vij is given by 

{°~i (ptp) + of (norm) i = j 
o~i(norm)aj(norm) i / j, with correlation (2-24) 
% j, no correlation 
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Fig. 3. The rescaling of data in Ref. 7). Only statistical errors are shown. 



Using this expression for Vij, Swartz obtained the fit shown with a solid line in Fig. 4. 
For comparison, Fig. 4 also shows the fit when all correlations are neglected with a 
broken line. 

The difference between the two cases is significant. When the uncorrelated fit was 
used, Swartz found that he reproduced the result of Ref. 5), in good agreement with 

Refs. 8), 9), and the observation of Ref. 4) that the value of Aa^d rons (rnz) * s relatively 
insensitive to the interpolation method. However, when he used the correlated fit, he 
found 

„(5) ,_2 ■ 



and 



^ILons^z) = 0-02672 ± 0.00075, 
aT l (mz) = 129.07 ±0.10, 



(2-25) 



(2-26) 



which differed from Eq. 2-14 by 0.20 « 2a. 

This analysis has been criticized 8 - ) on the grounds that including normalization 
errors in the correlation matrix will produce a fit which is biased towards smaller values 
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of R(s). This effect is beautifully explained in a very nice paper by D'Agostini 21 ). Since 
I feel that this is a very important point which everyone should know about, I have 
reproduced the basic argument in Appendix A. 

Swartz has subsequently updated his analysis to correct for this problem and also 
to include some data 29 ) which was missing from his first analysis 10 ). The new fit to 
the continuum part of R(s) is shown in Fig. 5, and was obtained by minimizing 

x 2 = y> [Ri ~ (1 + \j<*i)RMsi-, ak)? + y> A 2 (2 . 2?) 

where «j = ai(norm)/Ri, and \j are normalization parameters which account for 
collective rescaling of the data points in the j-th data set. (cf. Appendix A.) Using 
this new fit, Swartz obtained 

^Lonsiml) = 0.02752 ± 0.00046, (2-28) 

and 

a _1 (m z ) = 128.96 ±0.06. (2-29) 

The interesting thing about this result is that only 1/3 of the difference between this 
value and the previous one given in Eq. 2-26 is accounted for by the change from a 
biased to an unbiased fit. The remaining 2/3 of the shift is due to the inclusion of a 
single data point at W = 3.670 GeV from the Crystal Ball collaboration 29 ) which had 
a much smaller normalization error than all other data points in the vicinity. The effect 
of this point can be clearly seen by comparing Fig. 5 and the uncorrelated fit in Fig. 4. 
Due to normalization correlations, the entire fit function below W = 3.670 GeV has 
been pulled upwards and statistical fluctuations around this function suppressed. This 
accounts for the larger value of Aa^ drons {m 2 z ) and smaller error. 

2.5. Discussion 

In Refs. 4), 5), 8), 9) the difficulty of dealing with Type II correlations is discussed, 
but no strategy is presented as to how they are actually dealt with. Type I correlations 
are not even mentioned. Given that the uncorrelated fit of Swartz 6 ) reproduced the 
result of Refs. 5), 8), 9), I believe it safe to assume that none of these analyses took 
Type I or Type II correlations into account. The independence of the result on the 
interpolation method found in these works can also be understood as due to the neglect 
of normalization correlations. 

In the Swartz '95 analysis 10 ) , correlated data are collectively rescaled to obtain the 
best fit. Both Type I and Type II correlations are considered. The resulting fit function 
is quite different from the one integrated in the trapezoidal method of Ref. 8) as shown 
in Fig. 6. In my opinion, this analysis makes the best use of all the information available 
in the experimental data. However, the Swartz estimate is also susceptible to change 
drastically by the inclusion of new data. This is exemplified by the effect of the one 
data point from Crystal Ball at W = 3.670 GeV which 'fixed' the global normalization 
of R(s) below this point. If this point had not been included, then the estimate would 
have been much closer to the Swartz '94 value 6 \ Every time a new and precise data 
point is included in the analysis, the correlations will cause a ripple effect which may 
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Fig. 4. Fit to the continuum part of R(s) with (solid line) and without (broken line) correlations 
from normalization errors. The difference between the two in the 3.6-5 GeV region is due to a 
different treatment of the higher W resonances. The error bars on the data points show the total 
(statistical+systematic) error. 




100 



Fig. 5. The new fit by Swartz. The Crystal Ball data point at W = 3.670 GeV is hidden in the cluster 
of points in the vicinity. The error bars show the total error. 



shift the entire function upwards or downwards. Therefore, to be on the conservative 
side, the error should be doubled when using the Swartz estimate. 

The Martin and Zeppenfeld value 7 ) is obtained by using perturbative QCD to 
rescale the experimental data, so if one believes in perturbative QCD down to 3 GeV, 
then this would be a good value to use. One must remember, however, that theoretical 
prejudice has entered into the a(mz) value. The error is also artificially small due 
to use of perturbative QCD and cannot be justified by the data alone. To be on the 
conservative side, the error should be doubled in this case also. 

My conclusion is that the Swartz '95 analysis with doubled errors is probably the 
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Fig. 6. Comparison of the new fit by Swartz (broken line) and the trapezoidal integration of Eidelman 
and Jegerlehner (solid line). 

best estimate of a(mz) currently available. Namely, 

a' l {m z ) rs 129.0 ±0.1. (2-30) 

(Extra digits would be meaningless.) A better determination of a{mz) is impossi- 
ble with the current quality of R{s) data. Any further improvement requires better 
measurements of R(s) in the low energy region below 10 GeV 9 \ 



§3. The Value of a s (m z ) 

3.1. Current status 

In the case of a s (mz), there exist more than one way to determine it's value. 
Refs. 11), 12) give comprehensive overviews of the many ways to measure a s (mz)- 
In table II, I list some of the most recent determinations of a s {mz) using various 
techniques. 



Table II. Current status of the determination of 



a s (m z )- 


Method of Measurement 


a a (m z ) 


Deep Inelastic Scattering 12 ' 


0.112 ± 0.005 


GLS sum rule 12) 


0.108 ±0.008 


T decay into 3 gluons J0) 


0.108 ±0.005 


Voloshin 31 ' 


0.109 ±0.001 


Lattice 321 


0.112 ±0.007 


LEP 




Z lineshape data 14 ' 


0.125 ± 0.004 


R e only 14) 


0.126 ±0.005 


event shapes and jet rates 33 - ) 


0.123 ±0.006 


SLD 




event shapes and jet rates 34 - ) 


0.120 ±0.008 



As pointed out by Shifman 12 ), all 
the low energy determinations of a s (mz) 
are clustered around a s (mz) = 0.11 
while all the high energy determina- 
tions at LEP/SLC are clustered around 
otsipiz) = 0.125. This discrepancy, with 
a related discrepancy between the SM 
prediction and experiment in R[>, can be 
interpreted as a sign of new physics as 
discussed in the talk by Hagiwara 1 ). 

Another interesting possibility that 
was pointed out by Consoli and Ferroni 
in Ref. 13) was that the error on the LEP 
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Table III. R e values from LEP 1994 data. 





ALEPH 


DELPHI 


L3 


OPAL 


Re 


20.67 ±0.13 


20.96 ±0.16 


20.94 ±0.13 


20.90 ±0.13 




20.91 ± 0.14 


20.60 ±0.12 


20.93 ±0.14 


20.855 ± 0.097 


R T 


20.69 ±0.12 


20.64 ±0.16 


20.70 ±0.17 


20.91 ±0.13 



20.8 



Fig. 7. The Distribution of LEP 1994 Re data. The arrow indicates the weighted average. 



Fig. 8. The Distribution of LEP 1995 Re data. The arrow indicates the weighted average. 



value of Rt = T(Z — > hadrons)/T{Z — > £ + £~) may be grossly underestimated and that 
the actual error may be roughly five times larger. If that were the case, the error on 
the value of a s (mz) extracted from Rg will be enhanced also by the same factor and 
the discrepancy between the low energy and high energy determinations may vanish. 

Unfortunately (or fortunately, depending on your point of view) their argument is 
flawed. In the following, I will give an outline of the Consoli and Ferroni discussion 
and point out where their logic is problematic. 

3.2. The Argument of Consoli and Ferroni 

The argument of Consoli and Ferroni 13 ) was based on the Rg values from the 1994 
LEP data. In Table. Ill, I list the data for R e , and R T from the four LEP exper- 
iments. Neglecting small correlations and using the standard error weighted average 
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formula 



iTf I ^ 1 



x = 



the average of the 12 values listed in Table III gave 



£4' - 2 < ff i 

1 in Table III gave 

= 20.795 ± 0.040 (3-2) 



which corresponded to 

a s (m z ) = 0.126 ±0.006. (3-3) 

The observation of Consoli and Ferroni was that when the 12 data points were 
plotted in a histogram, it looked like Fig. 7 with two distinct peaks sandwiching the 
average value. This made them question whether the data points were normally dis- 
tributed around a common peak and whether it was correct to use Eq. 3-1. 

In order to answer this question, they calculated a quantity called the kurtosis 
defined by 

7 = M4/V2- 3 > O 4 ) 

where fj, n = J(x — x) n f(x)dx is the n-th moment of the distribution, and found it to be 
— 1.8. If the LEP data points were normally distributed, the probability of the kurtosis 
7 being this small is only 4 x 10~ 4 . Therefore, they concluded that the LEP R% data 
was not normally distributed and that Eq. 3-1 could not be used. 

They argue that in the absence of a sensitive way to combine the 12 measurements, 
the only thing that can be said about the true value of Re is that it must lie somewhere 
in the region 

20.60 <R f < 20.98 (3-5) 
which is obtained by visual inspection of Fig. 7. This corresponds to 

0.10 < a s {m z ) < 0.15 (3-6) 

which is perfectly consistent with the low energy value of a s (mz) = 0.11. 

The problem with this argument is simple: Eq. 3-1 is actually valid no matter 
what the distribution is as long as the data points are unbiased *) estimates of the true 
value. **) The kurtosis is a quantity which measures how spread out the tails of the 
distribution is compared to the normal distribution, and the fact that it is negative 
only means that the tails are less spread out than normal. (This is a typical symptom 
of LEP experiments since data points that are too far off from the SM prediction tend 
to get more attention and various corrections applied until the agreement is improved.) 
The kurtosis cannot distinguish whether there is one peak or two. 

If there are really two peaks, the separation must be due to systematic effects that 
are experiment and/or lepton flavor dependent. However, inspection of the numbers 
in Table III shows that the data points for each experiment and each lepton flavor are 
evenly distributed between the two peaks. There is no sign of any systematic effect 

*-* By 'unbiased' one means that (xi) = x. 

**' Determine the weights of a weighted average x = so that a 2 = {(x — x) 2 ) is 

minimized. One always finds Wi = 1/of and 1/ct 2 = Wi without making any assumption about the 
distribution. 
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so the separation into two peaks for this particular data set must have been purely 
statistical. This point is further supported by the latest LEP Rn data 14 ) in which the 
two peak structure has all but disappeared. See Fig. 8. 

Therefore, it is perfectly valid to use Eq. 3-1 to obtain Eq. 3-2, and hence Eq. 3-3, 
so the discrepancy in the low and high energy determinations of a s (mz) still persists. 

§4. Summary 

I have reviewed the current status of the determination of a(mz) and a s (mz)- 
For a(mz), I reviewed all the recent evaluations of its value and concluded that 

a -1 (m z ) = 129.0 ±0.1 (4-1) 

was probably the best estimate currently available. Further improvements require bet- 
ter measurement of R(s) at low energies below 10 GeV. 

For the a s (mz), I reviewed the claim of Ref. 13) that the value a s (mz) determined 
from R# should be assigned a larger error bar and concluded that the argument was 
erroneous. 
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Appendix A 

Correlations due to Normalization Errors 

In this appendix, I discuss the problem of performing a x 2 fit when normalization 
errors are present. To avoid unnecessary complications, I will only discuss the case of 
fitting a constant k to n data points Xi (i = 1, 2, • • • , n) which are all measurements of 
the same quantity x. Extension to the case of fitting with a function is straightforward. 

The standard method of fitting a constant k to n data points is to minimize the 
X 2 defined as 

x ^ = Y J {x l -k)V- 3 \x ] -k). (Al) 

i,j 

Here, V~ l = (V^ 1 ) is the inverse of the covariance matrix V = (Vij) whose elements 
are given by: 

Vij = {etej), (A-2) 

where (O) denotes the expectation value of O and is the difference between the the 
true value of x and the measured value Xi : 

x = Xi + ej. (A-3) 

It is assumed that the unbiased measurements of x, and that the a^s correctly 
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estimate the expectation value of e 2 , i.e. 



(d) = 0, (e?) = a, 2 . 



(A-4) 



When none of the ej's are correlated, the covariance matrix V and its inverse are 
diagonal: 



V ij = afS ij , \] 



-1 _ °ij 
a? 



and we find that the above definition of x 2 reduces to the familiar 



x 2 = E 



which is minimized at 



k — j — X. 



(A-5) 



(A-6) 



(A-7) 



Since (x) = x, we have an unbiased estimate of the true value x. 

In order to apply this formalism to the case where there is an overall normalization 
uncertainty in addition to uncorrelated point to point errors, I introduce a scale factor 
/ = 1 + €f, where (e/) = 0, and (e 2 .) = a"j, and modify the relation between x and Xi 
to 

x = f(xi + ej) = Xi + €i + €fXi + 0(e 2 ) 
The correlation matrix V is then modified to 



Vij = <(ei + e/Xi)(ej + e/Sj)> = °f + oJxjXj 



and its inverse to 



£7. 



1 



X% Xj 



1 



(A-8) 
(A-9) 
(A-10) 



The second term comes from the correlation due to the overall normalization error and 

,2 ; c 



goes to zero in the limit cxj — > 0. The % is then 



x 2 = E 



Xi - k 



E 



which is minimized at 



k 



x 



The expectation value of this quantity is 

(k) 



= k. 



(A-ll) 



(A-12) 



1 + (n - 1)(t^ 



(A-13) 
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so it is clearly biased towards values smaller than the true value x. In fact, (k) — > in 
the limit n — > oo. 

The source of this bias can be understood as follows. It is easy to show that 
minimizing the x 2 defined in Eq. A- 11 is equivalent to minimizing 



Obviously, the first summation term is minimized at / = k = 0, while the second term 
is minimized at / = 1. Therefore, as the number of data points increases, minimizing 
this x 2 has the effect of decreasing the first term by decreasing / at the expense of 
making the second term large. 

The problem with Eq. A- 14 is that only the data points X{ rescale with / and the 
errors <7j are left untouched. But since / is a normalization factor multiplying the errors 
also, they too should scale with /. Otherwise, the x 2 improves the agreement between 
data points by simply rescaling everything to zero! 

This problem can be corrected by defining the x 2 as 



In this case, the result is rather trivial since this x 2 1S minimized at / = 1, k = x. 
However, this definition can be easily extended to cases where there is more than one 
data set, each with its respective overall normalization error. In the extended cases, 
finding the minimum becomes a non-linear problem which is best solved by computer. 
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